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Abs tract 

In  this  paper  we  consider  a  resource  allocation  problem  in  a 
distributed  network,  where  processors'  speeds  can  vary  dynamically. 
The  problem  is  local  in  the  sense  that  the  maximum  number  of  users 
competing  for  a  particular  resource  at  any  time  instant  is  upper 
bounded  by  a  constant  v  and  also  at  any  time  instant  the  iiaximum  number 
of  resources  that  a  user  is  willing  to  get  is  upper  bounded  by  a 
constant  k.  Each  user's  demands  for  resources  may  change  dynamically. 
Our  solutions  to  the  problem  are  real  time  since  our  algorithm's 
response  time  is  upper  bounded  by  a  function  which  does  not  depend  on 
any  global  measure  of  the  system  of  processes  and  resources,  except  k 
and   V. 

We  improve  here  previous  real  time  solutions  given  to  the  problem 
in  [Reif,  Spirakis,  1932B],  which  had  response  time  of  O(v^).  In  this 
paper  we  provide  probabilistic  algorithns  whose  response  time  is 
po ly n ami al  to  v  and  k=  Our  most  efficient  algorithm  has  response  time 
0(vk). 

Unlike  the  probabilistic  algorithms  of  [Reif,  Spirakis,  1982B],  we 
do  not  need  to  use  random  waits  as  means  of  avoidance  of  process 
starvation  and  of  achieving  probabilistic  fairness.  Instead,  our 
algorithm  employs  a  new  method  of  probabilistic  bidding,  to  resolve 
contention  of  users  for  resources.  This  technique  is  essential  in 
achieving   polynanial   response   time. 

We  furthermore  prove  that  our  solution  is  optimal  with  respect  to 
the  average  response  to  a  user's  request.  In  particular  we  provide 
matching  lower  bounds  for  any  distributed  algorithm,  and  these  bounds 
are  within  a  constant  factor  of  the  response  time  of  our  own 
algorithms.  We  also  provide  a  suboptimal  algorithm  which  is  useful  for 
improving  the  throughput  rate  of  resource  allocations  in  systems  where 
the  majority  of  users  is  willing  to  get  only  a  few  resources.  This 
suboptimal  algorithm  does  use  random  waits  combined  with  probabilistic 
biddii^. 

Our  techniques  employ  limited  parallelism  within  each  process, 
together  with  the  probabilistic  bidding.  (This  limited  parallelism  is 
useful  in  achieving  optimal  response  time,  though  we  would  still  get 
polynomial   response   time  without   limited   parallelism). 


-2- 

1.      Introduction 
1.1      Resource  Granting   Systems 

In  this  paper,  we  consider  a  resource  allocation  problem  similar 
to  that  described  in  [Lynch,  1980]  and  [Reif,  Spirakis,  1982B].  There 
are  potentially  an  infinite  number  of  processes  in  the  system,  and  each 
has  an  integer  name.  Let  the  set  of  processes  be  called  11.  There  is  a 
potentially  infinite  set  of  resources,  p,  in  the  systsem.  Let  RSn  be 
the  set  of  proceses  which  control  re^'oiirces  Let  V  (the  set  of  user's 
processes)  be  II -R.  Each  resource  p(j)<r  p  is  controlled  by  a  distinct 
granting  process  j.  This  process  has  the  r'^spoasibili  ty  of  granting  its 
resource  to  a  distinct  xxsf.x  proc^,ss  in-  U,  i,e.  no  two  different 
processes  in  U  may  use  th-s  satn'^  resource  at  ths  same  time.  We  assume 
user  processes  communicate  on?.y  wi'h  thot>''  ^r^ntlng  processes  for  which 
they  request    resources. 

A  system  as  above  I  rj  called  Resmrce  Granting  System  (RGS);  see 
[Reif,  Spirakis,  1982B].  Tlie  goal  of  an  RGS  is  to  satisfy  dynamically 
changing  user  requests  for  resovroe  allocation,  in  a  distributed  way, 
by  only  a  local  communication  befeen  granting  and  requesting 
processes.  We  assume  that  there  is  a  global  time  t  totally  ordering 
events,    but   processes   may   not   have  access   to   the    global   time. 

We  now  describe  the  adverse  oracle  A  which  has  the  power  to  set 
acti;i-s  in  the  worst  possible  way  to  increase  the  response  time.  A  has 
also  the  capability  to  select  at  time  t=0  the  schedule  of  the  speeds  of 
all  prot.  ^ses  at  all  times  t>  0  .  In  addition,  at  each  time  t  >  0,  the 
requests  i  user  processes  U  are  specified  by  A.  The  adverse  oracle  A 
is  restrict  ;v.,  to  allow  users  to  keep  asking  for  their  resources  until 
they  are  grant-  ,.  in  practice,  no  such  A  may  exist  but  coincidence  of 
worst  case   situt.    ^'ons    may   replace   its    action. 

A  process  stc;  consists  of  either  an  assignment  of  a  variable,  a 
test,  a  logical  or  arithmetic  operator  or  a  no-op.  A  step  is 
considered  to  be  a  finite  time  interval  A  in  which  a  single  primitive 
instruction  is  executed  just  at  the  last  moment  of  A ,  and  no  other 
instructions  are  executed  within  A.  Let  a  process  be  tame  during  a 
time  interval  A ,  if  for  any  interval  A '  which  intersects  A  and  is  a 
single   step  of    the  process,    then   ]a  '  ]         [r^in,r^x]   where   r^in.r^x  ^re 
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fixed  real  constants   wi  th  0     <      r^^^j     <       %ax*        ^^      will     not      require 

processes      to      be      tame     at      all     times.        However     our     proof  that   our 

algorithms   are   real   time  makes  an  assumption   that   processes      are      tame. 

We      assume   that   processes   are   reliable   in  the   sense   that   they  perfectly 

execute  their   programs.      The  rate   of  execution   varies    dynamically.        We 

require     that,      at     no      time,      arty    granting   process   i     R  simultaneously 

grant   the   resource    p(i)   to  more    than  one   requesting   process.        We     also 

require      that,      as     soon      as      a     jirocess      j      U  has   got   all   its    required 

resources,    then   it    can  keep   thenf  only  for  a   time   interval   whose      length 

is      upper      bounded      by  a  fixed    parameter  6    (containing  at  most   an  upper 

bounded  number  of   its-^teps,   y    =  iS /^jn^j^,    if    the   process    is    tame).         Let 

resources (.(i)      be     the   set    of   resoufdes   that    process   i  is    requesting  at 

time   instant   t.   Let  k^   j.   be'^' lr€!sourdaif.(i)  ] .      Let   askerSj.(j)  be  the   set 

of      user      processes    '  retjiies^i'n^^    W5'5      for     j     R  at   time   t.   Let  v-   ^   be 

]askers  .(t)  ].      We   assume  that   at   all   times  t   >    0,      '^ ^    t      ^""^     ^   t      ^^® 

upper  bounded  by  constanta_v/lr!  respectively  and    that   k  <    v.      (This   does 

not   necessarily      imply     arty     -bounds  '"''on  ^  t    ^        resources ^(i)  I      or      [    U 

t=0  "-  t=0 

askers^(j)]    for  any  i   in  V 'ot  j    in  R. 

With   respect   to   interprocess   communication  we   assume    (1)    that   each 

resource  allocator   j     R  has  a   set   S-  available   to   it   of   size   at   most   v, 

containing      the      names      of   those   processes   willing   to  get  the   resource. 

As    in  [Lynch,     1980]   we   assume   this   to  be  a   primitive  of   our   system      (it 

could     be   implemented  by  a   queued  message  system,    for  example)    (2)    that 

synchronization   must   be   done   by  special  variables,    called      flags ,      each 

of     which      is   written  only  by  one   process   and   read  by  at  most   one   other 

process.      Read-write  conflicts   on  flags   are      excluded      by      our      process 

step     semantics  and  by  our   notion  of   global   time.      Flags    seem  to  be   the 

simplest      primitives      for     synchronization     and        lead        to        an        easy 

implementation      (in     contrast      to  e.g.      distributed    shared  variables    of 

multiple  readers   and   writers). 

1.2    Implementations   and   Complexity   of  an  RGS. 

An     implementation     of     an     RGS     determines      the        synchronization 

algorithms        that        the        processes        run.  As        stated        above,      the 

synchronization     algorithms        use        only        flags        to        implement        the 

synchronization        between       processes.  We     consider     a      time-varying 


hypergraph  H^.  vd.  th  node  set  11  and  time-varying  hyperedge  set  E^  =  {  [i] 
union  resource^ (i )  :  i€U}  ,  i.e.  where  a  hyperedge  at  time  instant  t  is 
the  set  of  nodes  of  11  consisting  of  a  single  process  i  in  U  and  the 
granting  processes  of  the  resources  i  is  willing  to  get  at  t.  An  RGS 
implementation  dynamically  achieves  distributed  matchings  in  the 
hypeigraph  H^.. 

For  each  adverse  oracle  A>  let  the  response  time  of  the  RGS 
implementation  be  the  random  variable  Y^  )^  which  is  the  length  of  the 
smallest  interval  A  required  for  any  process  iCU  to  have  k  resource 
requests  simultaneously  granted  ^  gi\an  that  i  requested  these  resources 
during  the  entire  interval  A  and  ^ssuning  that,  i  and  all  allocators  of 
the   resources    requested  bj   i   withiri  A,    are   tame   vrithinA. 

Let  the  mean  regponse  J ^  be  th£  ii^x{  mean  {y^  y.}  over  all 
oracles  A}  .  let  the  c  -fcrror  j.£Gpoi.v.e  be  c'.ie  idninum  y^(e  )  such  that 
for   every    oracle  A 


The  RGS  implementation  is  _real  Liae_  if   v^ '■J  (0,1]  ,  Y^(e  )    >0     and     upper 

bounded   by    a   function   independent   ol   any  global  measure  of    the  network. 

(Note:    A  global   measure  of  the  network  is   any  positive      function     g      of 

h  =     In  I    such   that   lim      .    .    =  0).     Note  that   if   an  RGS    in^lementation   is 

hvoo    g(h) 
real    time,    then   the   mean      response     fu      is     also     upper     bounded      by     a 

function   independent    of   any   global  measure  of   the  network. 


1.3    Previous   work 

[Rabin,  1980a]  first        applied        probabilistic        choice        to 

synchronization  problems  in  distributed  systems  and  provided  a  solution 
to  the  dining  philosophers  problem  which,  with  probability  1,  is 
deadlock     free        and        starvation        free.  [Rabin,         1980b]        applied 

probabilistic  coordination  methods  to  synchronize  access  of  processes 
to  a  critical   resource   in  a    space -efficient   manner. 

[Lynch,  1980]  first  posed  the  localized  resource  allocation 
problem  as  a  formal  synchronization  problem.  Let  the  resource  graph  G 
be    the   graph   whose  nodes      are      the      resources      and      two     resources      are 


connected  by  an  edge  if  there  is  ever  a  user  process  requesting  both  of 
them,  maybe  at  different  times.  Let  x(G)  be  the  chromatic  number  of  G. 
The  implementation  proposed  by  Lynch  was  a  deterministic  one  in  which 
processes  should  know  the  color  of  each  resource  in  a  coloration  of  G. 
The  response  time  achieved  in  [Lynch,  1980]  is  of  the  order  of 
x(G)v^^^^«T  where  t  is  the  time  necessary  for  interprocess 
comiainication.  This  was  not  a  .real  time  implementation  since  x(G) 
could   be  0  (  |n  ]  )   in   general. 

[Reif,  Spirakis,  1982B  ,  ]  provided  the  first  real  time 
probabilistic  RGS  implementation  of  Y^(e  )  =  0(kv^log(_)logX.)  and  f  =  0 
(kv  logv).  In  that  previovts  ^per,  we  used  the  techniques  of 
probabilistic  selection  of  processes  byresource  allocators  and  random 
waits  to  avoid  adverse  Siphed^iies  _-Oif  sp~eeds  which  might  be  set  up  by  the 
oracle.  Although  thi-s  weiio^  ^iti^  ?_time  Implementation,  it  is  still 
exponential   in  k. 

1,4   The  new  results   of   this   paper. 

We  shall  present  in  section  II  a  probabilistic  implementation  of 
an  RGS,  with  mean  rsponse  f^  ^  0(kv)  and  Yi<^(e  )  =  O(kvlog(— )).  To 
achieve  this  response,  wc  -niake  essential  use  of  a  new  probabilistic 
bidding  technique,  together  with  use  of  limited  parallelism  within  each 
user  process  and  each  resource  allocator.  In  our  primary  algorithm  we 
do  not  use  random  waits  to  achieve  probabilistic  fairness.  Instead  we 
use  only  the  new  probabilistic  bidding  technique.  In  particular,  we 
slice  time  of  each  process  into  rounds.  In  each  round  each  user 
process  tries  to  get  all  the  wanted  resources.  It  has  to  get  all  of 
them  in  the  same  round.  The  users  deny  the  allocation  of  resources  to 
them,  unless  all  the  required  resources  are  offered  to  be  allocated  in 
a  small  number  of  steps.  At  the  end  of  the  round,  users  release  their 
allocated  resources  (if  any)  and  make  a  fresh  start.  User  rounds  have 
the  same  length  in  steps  for  all  users  and  this  length  is  a  parameter 
of  the  algorithm.  In  contrast,  resource  rounds  are  not  of  the  same 
length  and  their  length  in  steps  is  not  fixed  in  advance,  but  adjusts 
to  the  conditions  of  the  algorithm.  We  conjecture  that  this  is 
essential    in  avoiding   exponential   growth   of    the   response   with  k. 

We   also   prove    lower  bounds    of  f2  (kv)    for    the  worst   case   and   average 


response  time  of  any  algorithm  for  the  stated  local  resource  allocation 
problem,  thus  cxir  proposed  technique  is  of  optimal  performance  within  a 
constant   factor. 

Finally,  we  provide  a  non-optimal  algorithm  which  is  of  polynomial 
mean  response  time  in  k  and  is  useful  for  improving  the  throughput  rate 
of  resource  allocations  in  the  nrtworkr  In  particular,  it  allows  user 
processes  which  demand  less  tVian  k  resources  to  have  higher  probability 
of  being  assigned.  This  algorithm  has  the  p-'^perty  that  if  a  user  i  U 
has  a  request  of  at  most  k.  <  k  resourct^s^  then  it  has  mean  response 
Y,^_    =  0(vk^' lo^«  log(k^v)). 


1  ,  "^  Applicatipnc 

Example  J_:    k-fork  Phi  Iog  op  he  ra    ,-       ,■  ij-fr  i-ri      •.   -  ' 

As  a  simple  exampl.  of  e  i  R^^S  v^  ccrridGr  i  generalization  of  the 
dining  philosophers  problem  to  the  case  --.'here  each  philosopher  requires 
k  forks  to  eat.  Note  that  we  can  extend  this  to  the  case  where  the 
identities  of  the  forks  lequixrd  by  etch  philosopher  change 
dynamically.  Let  the  set  cf  "forks"  t^^  R  -  {r^....,r  }  and  the  set  of 
"philosophers"  be  U  =    {uj^,.,.,u^}    and   let 

resources^(ui)    =   ^^i '^(i+l)modn' •  •  "  ^(i+k-l)modn> 
and 


askers^Cr^)    =   {ui_k+l)n,odn.  •  •  •.'^(i-l)modn' ^i^    ' 


for  all   t.    Our  new  resource  allocation  algorithm  achieves   mean  response 
time     Y^^     =      0(k    ).         In      contrast,      our  previous   results  achieved  mean 


Example   2_     Two-phase   locking   in  databases 

Two-phase   locking    is  a  concurrency  control      method      in     databases; 

for     a      survey  see   [Bernstein,   Goodman,    1980].      It   has   the   feature   that 

as   soon   as   a  transaction  releases   a    lock,    it  never     obtains      additional 

locks.      A  veiy   efficient   static    implementation  of   two  phase    locking   can 
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be  achieved  by  our  methods.  Our  assumption  is  that  transactions  are 
allowed  to  act  on  the  data  only  if  they  got  all  the  locks  requested. 
In  the  context  of  such  a  database  system,  let  the  users  In  U  be  called 
transaction  modules  and  the  processes  of  R  be  called  data  modules.  If 
the  readsets  of  the  transactions  are  of  cardinality  at  most  k  at  each 
time  instant  and  if  at  most  v  transactions  can  compete  for  a  lock  at  a 
time  instant  t,  then  our  optimal  RGS  will  result  in  an  0(vk)  mean 
response  time  per  transaction.  Our  suboptimal  RGS  achieves  an  even 
smaller  mean  response  time  when  Ireadset^(t)  ]  =  o(k).  In  this  case,  if 
a  transaction  wants  to  lock  k.  data  items  at  a  time,  it  has  a  mean 
response  of  0(vk^  log(vk^)lo^)  time.  (However  this  becomes  0(vk 
log(vk)lo^)  when  Ireadset^(t)  1  =  k) .  Our  implementations  of  two  phase 
locking  proposed  in  this  paper^are'-'asymptotically  much  more  efficient 
than  the  static  locking  method  proposed  in  [Reif,  Spirakis,  1982B], 
which  had  a   mean  rfespons-^igfae'  eCnc-^^"^--^/-      •■  - 


2.  Our  distrtbuted'^profeabilistic  bidding  algorithm. 
We  assume  that  the  reqiiescing  processes  communicate  only  to  the 
resource  allocators  whose  resources  they  want  (or  have  been  allocated), 
and  that  granting  processes  j  a  re  willing  to  communicate  only  to  the 
requesting  processes  in  the  set  S.  (as  defined  in  Section  1.1).  The 
actions  of  the  requesting  and  granting  processes  are  time-sliced  in 
rounds ,  each  round  being  a  repetition  of  a  basic  set  of  actions. 
Processes  use  indeperdent  sequences  of  probabilistic  choices  as  the 
basic  counteract  to  adverse  speed  schedules  and  adverse  resource 
demands  set  up  by  the  oracle  A.  We  assume  that  A  cannot  affect  or 
foresee  the  results  of  these  probabilistic  choices.  We  allow  each  user 
in  V  and  each  resource  alloctor  in  R  to  have  a  set  of  synchronous 
parallel  subprocesses,  which  aid  in  our  algorithms.  The  use  of  local 
parallelism  here  is  not  actually  essential  in  achievement  of  polynomial 
response    time. 


2.1  An  informal  description  of  the  rounds, 
(a)  The  user's  round 
A  user's  round  starts  with  the  user  drawing  (with  equal 
probability)  a  random  number  in  the  set  {l,2,...3kv}  where  g  >  1  is  an 
integer.  If  the  number  drawn  was  less  than  kv,  the  user  remains 
nonactive,  until  the  end  of  the  rourdr..,  (All  users'  rounds  take  a 
predetermined  number  of  stcps^.  Else,  the  user  immediately  notifies 
(by  the  use  of  at  most  k  parallel  sjTichronou.';  subprocesses )  all  the 
resource  allocators  of  the  r;:source?  he  'vants,  that  he  is  a  winner. 
Then,  the  user's  parallel  r.ubprocesses  collect  answers  from  the 
resources  for  a  period  which  it  bounded  by  a  constant  number  of  steps. 
During  this  period  some  of  th3  resourcs?  nay  declare  that  they  agree  to 
be  allocated  to  the  particular  user.  However,  if  at  that  time,  any 
other  resource  requested  by  nhat  user,  is-dei/.tic  ,  *-ben  that  user  does 
not  utilize  the  resour-:3S  viij-h  agreed-  tc  '.£  -.llocated  to  him,  but  he 
continues  to  report  that  he  is  a  winner  to  all  of  his  requested 
resources  and  repeats  the  slgoriuVi  (witVcut  drawing  again),  until  the 
user's  round  ends.  If  c.l?.  of  th..  wanted  resources  agree  to  be 
allocated  at  the  same  time,  the  usei  r-tiiizes  then  for  y  steps  and  then 
he  releases  these  resources.  This  is  done  In  parallel,  by  explicitly 
notifying  the  granting  processes  of  the  release,  using  his  k 
subprocesses.  Note  that  a  communication  with  all  the  k  resource 
allocators  takes  only  r^g^  ^  ^™^  ^^^  ^°  ^^^  limited  parallelism  and 
tameness    of   processes. 

(b)  The  resource  allocator's  round. 
A  resource  allocator's  (j)  round  starts  with  a  monitoring  period 
(of  a  constant  number  of  steps)  during  which  at  most  v  parallel 
synchronous  subprocesses  continuously  monitor  the  users  of  the  set  S., 
looking  for  winners.  Let  M.  be  the  set  of  winners  detected  during  the 
monitoring  period.  If  M  .  contains  more  than  one  winner,  then  all  the 
elements  of  M.  are  notified  (in  parallel)  that  they  have  been  denied, 
and  the  round  ends.  However,  if  M.  has  a  unique  winner,  then  the 
granting  process  notifies  the  winner  that  it  agrees  to  be  allocated. 
If  the  winner  does  not  accept  the  agreement  then  the  round  ends.  If 
the  winner   accepts,    then   the   round  enters  an  allocation  period.      During 


this  period,  the  parallel  subprocesses  of  the  resource  allocator  deny 
all  appearing  winners.  The  round  now  ends  by  receipt  of  the 
notification  by  the   user    that   the   resource  has    been   released. 

Note  that  communication  with  all  v  of  the  user  processes  and  all 
set  operations  in  a  resource  allocator's  round  take  only  a  constant 
(Independent  of  v  or  k)  number  of  steps  due  to  the  parallelism 
employed.      Note  also   that   the   following   hold  with   certainty: 

A  resource  decides  to  be  allocated  to  a  unique  winner,  only  after 
the  resource  allocator  agrees-  to  allocate  the  resource  and  the  winner 
accepts  the  agreement.  Thus,-  no  resource  can  be  allocated  to  more  than 
one  user  at   the   same   time/'ijy  Our '  bidding   algorithm. 

For  a    detailed   descilptian  of  tfre  algorithm,    see  appendix   I. 

2.2;  H&p^titi&s   of- fhe   blading   algorithm. 
As  sural  iTg   all  procebisssi  trd«6'^ -^ra^sHfe'll -prove   that 

Proposition  j^.  If  a  partlcuiaf  use'rti  is  a  winner.  in  its  current 
round,  then  its  k  parallel' syn  char  onous  subprocesses  will,  at  least  once 
in  its  current  round,  report  at  Che  same  time  that  1  is  a  winner  when 
all  requested   resources   are   in  a   monitoring   period. 

Proposition  2^.  Given  that,  a  particular  user  1  is  a  winner  in  its 
current  round,  the  probability  that  he  stays  a  unique  winner  for  all 
his  wanted  resources  during  the  whole  round,  is  lower  bounded  by  a 
constant,    independent    of  k    or  v. 

Let  a  draw  by  a  user  be  a  random  independent  selection  of  one  of 
the   numbers    { 1,  2  ,  .  . .  ,Bkv  },    P    >    1. 


Proposition     3^.         During      a      round   of   user    1,    any   other    distinct   user    j 
competing    for   the   same   resource   cannot   draw  for  more    than  times. 


These   propositions   will  help   us   prove 


Theorem   1.      The  probability  that   a   user   is      allocated      all     his      wanted 

resources      in      his      current  round      is     upper     bounded  by  —, —  and   lower 

1  gkv   r 

bounded   by  wherer    d   >   1      is     a      constant,      depending      on     and 

dkv  r^^^  ^ 
independent    of  k    or  v. 
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Theorem  2.  Y^(£  )  =  0(kv  log(— ) )  and  f^  =  0(k.v)  for  the  bidding 
algorithm. 

For  proofs  see  Appendix   II. 

A  corollary  of  theorems  1  and  2  is  that  our  algorithms  do  not 
deadlock,  no  process  starves  (these  hold  with  probability  1)  and  our 
algorithms   are   probabilistically  fair. 


3.    An  J2(kv)    lower   bound   for   the   lcc3.?.    resource  allocation   problem. 

Theorem     3_.        For     k   >   0.    Uherci   is   a  network   in  which  at    least    one   user 
process  has   to  have   a   reppon^e   tiiQR   '^f   '  ■:    least    (kv-l)u    steps. 

Proof  o'.'I  ncJ. .' 

Consider  a  networ1<  wf.r.h  a  cOJ:  of  -ec  vcas  ?v  such  that  JR]  =  k  and 
a  set  of  users  U  such  tha'-  [u]  ■•-  kVc  L  *:  A  be  an  oracle  such  that  all 
processes  are  equi speed,  synchronous  and  such  that  Vt  >  0  ,  Vj  U, 
|resourceSj^(  j)  1  >  (k/2)  +  1„  '?iieii,  only  one  user  process  can  be  granted 
all  its  resources  at  each  time  in3':an-  t  ,  To  r.ee  this,  assume  for  sake 
of  contradiction  that  at  Isasu  two  user?  j]^.J2  are  granted  all  their 
resources  at  time  t.  Each  resource  h?.s  to  be  granted  to  only  one  user 
at  a  time,  thus  forming  a  bipartite  matching  of  the  hypergraph  H^.,  as 
defined  in  Section  1.2.  Hence  we  conclude  that  the  number  of  allocated 
resources   at   time   t   is 

]resourceS|.(  jj)  ]   +   ]  resourceSj.(  J2)  I   >   k+2   >k        for  all   j  >    0. 

This      implies      that   resources  will  be  allocated    to   processors   serially, 

hence   the    last   process   of   the  serial  order  will  have  a    response  time  of 

at       least      (kv-l)vi      steps.  The     above     holds      independently      of      the 
synchronization   technique. 

Corollary.      Our     probabilistic      bidding      algorithm     of      Section      2     has 
optimal  mean  response   within  a    constant   factor. 


Proof.  By  Theorem  3  and  by  the  fact  that  given  any  imiltiset  of  serial 
orders  of  kv  elements,  there  is  at  least  one  element  whose  average 
position  (over   the   nultlset   of   orders)  is  at    least    [kv/2]. 


4.    M   alternative  distributed   resource  allocation  algorithm 

which   improves   the   throughput 

4.1   Motivation 

Theorem     3      provided      loweY  bounds    for   systems  which  are   saturated 

with  requests.      In   practice,   systems  will  not   continuously  have  so  many 

requests      at  all   times.      This  section  prt:(vides   an  algorithm  which   gives 

good  response   time   in   the   cate   in  which -the   system     is     not      saturated, 

(though      it      is      not   asymptotically  efficient   for   saturated   systems,    as 

the  algorithm  of    Section  II)o 

4.2  Our  alternak-..a^:!i§tribQ ted 'allocation  algorithm 
The  user  process's  rbu5i53s'  arfe  ttie  sama  "ks  in  Section  2.1,  with  one 
addition:  Each  round  now  starts  with. 'the  user  waiting  (nonactive)  for  a 
randomly  chosen  number  of  step'Sv'Qnif o^  in  an  interval  upper  bounded 
by  a  suitable  constant' Ci,  chcstn  In  such  a  way  that  Cj^*  r^^^^j^  is  greater 
than  the  maximum  possible  duration  of  the  useful  part  of  the  round. 
The      resource      allocator's      rounds      are      split      into      a     series    of    logk 

m 
j      for     which      k.e[_!^   ,  ^ 

proceeds   to    the   next    interval  A^.i   only    if      all     user     processes      which 

r     k  k  -I 

demand      k-      resources,      k^      in       ,     —  ,      have   been  allocated   their 

J  '        J  >■  2m+r      2"* 

resources.      Within  each   interval,   the   resource   allocators    go  through     a 

sequence   of   rounds,    as   described    in  Section   2.1, 

4.3      Probabilistic  Analysis 

Let     us      consider      a      time      interval     A„.      Let   u'    be   the  number  of 

m 
r     k  k 

rounds   required    for  user   i   with  k.    in  [ ,  — J  ,    and  also   for  all     (at 

2m+l      2™ 
most  k.v)  users    competing  with  user   i,    to  have  all   resources   allocated. 

Let  u    be  the  number  of   rounds    (within   interval      ra)      required      just      for 

user     i.   Let  u(e  '  )   be   such   that  Prob{u  <    u(e  '  ) }    >    1    -  e  ' .      Then  Prob{u' 

<   u(e')}  >    (1-e')   i^,    because  users    take   independent   actions   and   due    to 

the   random  waits.      Let   us   set  e  '    =e/k,-v. 
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Then,   we  get 


ProMu'   <   u(-i-))  >    (1    -  -L_)^i''  >    1    -e 
k.v  k.v 


Hence,    thee-error   response   for   all  users    in   the   interval  A    ,    is 


Y^Ce)    =0(h^v    log    :..f-)) 


k  - 

where      h„     =     —  ,    leading    to  a   mean  response  y-L.      =  0(h_v  log(h_v))   for 

^2^  "m  ™  ™ 

the   interval  A    .      The   abc/e   deteTinine&  jjirobabilistic     upper      bounds      on 

the   e-error   length  and  Tie?n   le"pth  cf   '■.ht   •'nt^rval  A    .      Since   there   are 

r     k  k  , 

logk  intervals,    the  e-erroi    response   for   a  user  with  k.    in 


)m+l' 


will   be 

T^   (e  )    =  0[ —  V  log(— ^)log  k)     implylag  ,  -; . 
i  2™  2^ 

a     mean   response  -fv      -  o[ —  logk  log( — ]  .      For  users  with   small   demands 

"^i  2™  2^^^ 

the  above      mean      response      outperforms      the      probabilistic      bidding      of 

JL-,  —\    and  2™  =  6  (k),    we   get   a  mean 

response  y^_    =  0(v   logk  logv)    . 

Hence    we  have 


Th e ore m  4^.      Our  alternative  distributed   resource   allocation     method      of 
Section     A,      has    the    following    property:    For  users   i    with   demands  k^    in 


V/2'    \i\    ^""^  \i  ~    2~™*k,    thee-error   response 


h^v 

is  Y^^   (e  )    =  0(h  V    lo^    log( ))  and   the  mean  response   is  f^^        =     0(hjjjV 

m  E  m 

logk    log(h    v)). 
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Appendix  I^ 

A  detailed   description  of   the   probabilistic  bidding   algorithm. 

In  the  following ,        we        set        parameters       B    =    2         "^^        , 

c    =  (2(2-Hi)+l)_^if  .      (Only    for   the   subopitmal  alternative  "^" 

'^min 

algorithm  of    Section  4,    c^   =   (2(2-nj)4l)(  "'^^)2, 

The  users  use  the  followirig  flagcr  For  user  i  the  flag  W .  .  =  1  iff 
i  is  a  winner  and  is  willing  to  get  resource  p(j).  The  flag  A^.  =  1 
iff  user  i  accepts  the  allocrtion  of  the  resource  p(j).  Both  flags  are 
0  else.  The  flag  N.  .  is  usually  0^  it  becomes  1  when  user  i  releases 
resource    j. 

The  resource  allocators  j  use  the  following  flags:  E.^  =  0  if  the 
resource  is  denied  and  ^  if.  j  agrcor  ,,that  its  resource  is  allocated  to 
i.  Each  allocator  j  has  a3so  n  shirfed  (fot:  itf.  parallel  subprocesses ) 
variable  M.  which  allo^yj  ••. jr. -.ur-^eni-  "re ■:"■'_  a-.d,  incase  of  multiple 
writes  of  the  same  value^  *heir  t-'ti  is  Tecorded.  M.  is  used  to  count 
winners  during  the  monitoring  period,,  Each  user  i  uses  a  shared  (for 
all  its  parallel  subrpocesses )  variable  L^.  It  pIIows  concurrent  reads 
and      concurrent   writes    of   the   same  vrlv.ec      In    -:a3e   of   concurrent   writes 

to 

identify  situations  in  which  all  wanted  resources  have  been  proposed  to 
be  allocated    to  user   i,    at  the   same   time. 

The  counters    counter^,   counter-,    count  steps   of   i,j   in  a    round. 


The  user's    round    for  user   i. 
(Initially  W^.   =   A^  .   =   N^.   =  0  Vj   =    l,...,k   and    k^   =   1) 

s  tart    round 

choose   X  randomly  uniformly   from    {1,2,  .  . .  ,ekv}   if   x*   3kv    then  do 

c-1  no-ops;    go    to   finish 

repeat:cobegin        (comment   in  parallel   for   j   =l,...,k) 

W,.  =  1  -  ,      - 

if  E.^   =-0   then  L^   =   0  else  L^   =   1 


if   L.    =   1      AND     counter^    <  c-p      then 

begin         '-"^''-  ''''•      -  -    ^^■■-'   ■.    ■"'  "     ' 

cobegfn   ---Cc'oAffiant      All'tesources   are  allocated) 

Ajl  =«of)    b^cofiment      accept) 
:.T*  us^^g6sdiSJ»%-9'£^-j)   f d¥  v<    steps 

N^.  ^31      (cc:^i3eht   release  resource) 
coend         -^      •       -'-';    '    •"     '  * 
Nj.   =  0;    Ai.r=i.O--  • 

waiit  until  doafitefj    =-c 
go    to   finish     '■'■        ^    • 
end  -•■  -  • 

else 
begin 

cobegin 

Aj.  •   =  0  (comment   deny  allocation) 

coend 

if      counter.    <  c-u~2     then   go   to   repeat 
else 

begin 


end 

finish:    end   round 


The   resource  allocator's    round   for  allocator   j. 
(Initially  M.   =  0,  G^.   =  0,   a^^.   =  0) 
local   a^. 
start   round 

In  parallel   do  until       ccunter  .  =  \i 

if     W.  j   =   1      then     (Mj   =  Hy',-1-    a^^   =   1) 

£d 

cobegin 

if      (M.   =  0     or     ¥.  >    2)      then 


»■       ^  y           I.  1 1  ^,  11 

begin 

•ji-j  ,go,to      fir 
end 

ish 

.^-elce,    .    , 

begin.       r- 

if  Hj  =^ 

then 

begin      ; 

if  A,  .   = 

0      then  : 

begin 

■    '    '    '^1:.: 

hi- 

go  to 

0 

finii 

5h 

end 
else 
begin 

repeat    until   N.  .   =   1 
(COM>ENT:    resource  allocated 
until  notification  by  user) 
E.,   =   0 
go  to      finish 
end 
end 

else 
repeat      ^^i   "^  0  until   resource    dealloca 
end 


coend 
finish:    end  round 


Appendix   II 
II. 1   The  proof   of   the   properties    of   the   bidding   algorithm  for   the      case 
of   equispeed    processes. 

Let  us   analyze   first   the   simple  case   In   which  a   fixed   time   instant 

tQ   exists    after   which  all  processes   are  executing      at      the      same      speed 

(r   ..        =      r        ).        However,      for      t     <      tr,      the  processes   may  have  been 
ml  n  ins.  x  '^ 

asynchronous  and    so,      at     t  >    t^,      the      times      of      execution     of      their 
programs      may   be  shifted   relative ''to  each  other,   in  an  adverse   way,    due 
to   the  oracle    A.    In    this    case,   g  ■"=    2^    is   enough. 
Proof  of  Proposition  _1  ■       ^ 

Assvme  that  user"l  P^s  ju^t^  being  declared  a  winner  in  his  current 
round  because  he  had  drawn  the'"=Tiumber  3kv  in  the  probabilistic 
selection.  Within  a  step,  allsfesource  allocators  are  notified.  If 
some  resource  allocator  is  in  ^Saffil  allocating  period  at  the  time  of 
notification,  then  he  is  going  "to^nter  a  monitoring  period  by  at  most 
a  number  of  steps  equal  to  ^ha  allocating  period.  If  a  resource 
allocator  was  in  a  monttorltig  period  at  the  time  of  notification,  then 
he  will  continue  being  in  such  a  period  (since  his  resource  cannot  be 
allocated  to  another  winner,  dua  to  the  presence  of  winner  i).  So,  by 
at  most  a  number  of  steps  equal  to  an  allocating  period  from  the  time  1 
decided  that  he  is  a  winner,  all  his  requested  resources  are  going  to 
be  notified,  at  the  same  time,  within  their  monitoring  period,  that  1 
Is   a  winner. 

Proof    of  Proposition   3^ 

Since  users'  rounds  are  of  equal  time  duration  in  the  case  of 
equispeed  processes,  only  one  draw  of  another  user  is  possible  within 
user  1 's  round.  Actually,  exactly  one  draw  of  each  competing  user  will 
take  place   somewhere   within   user   i's    round. 

Proof    of  Proposition   2 

The  probability  that  user  1  remains  a  unique  winner  during  his 
current  round  for  all  his  wanted  resources,  given  that  1  is  a  winner, 
is  equal  to  the  probability  that  none  of  the  competing  users  manages  to 
be  a  winner   within   i's    round.      The  number  of   the   competing   users   is      at 


most     kv      (at     most     v  competitors  per  each  of   the   k,  resources   asked  by 

user  i)   and  each  competing  user  has    exactly    2  draws      which      could     make 

him     a     winner      during      i's     current      round    (by   virtue  of   Proposition  3 

above,    one  is   the   draw  during   the   current   i's    raind  and   another   is      the 

draw     during      tVe   previous   round.      The  probability   of  each   draw  failing 

to  win  is    1    -  ,    (since,    in  the   case   of   equispeed   processes,   g    =    2), 

2kv 
hence      the     probability   that   i    stays  a   unique  winner   during  his    current 

ind  is   at    least    (1    -  -J—)'^^^   >  -i-  ,   where  e   =  2.73    . 
2k  V  2e 


rout 


Proof    of  Theorem   l_ 

The  probability  2.^T^,A),    for  oracle   Aand  history  F^,    that   a     user 

i      is      going      to     be      allocated   all  his   wanted  resources   in  his    current 

round  starting   at  t,   never  exceeds  — —  due   to   the   fact    that   prob{user  i 

Bkv  ^ 

chooses      to  be  a   winmr    in  his    current   round    }    =  — —  .      Given   that  user 

.;   e    o     sv  v;;i      •-  •  pkv 
i   chooses   to  be  a   winner,   \{  he  remains   a  unique  winner   during      all     of 

his     current      round,      he      is    goind    to  be  allocated   all   of  his    resources 

(due   to  Proposition   1)  with  csrtainty.      Multiplying   probabilities    given 

by  Proposition  2,    we  get   that 


^^^'^^   '    2^i=   4^^ 


Hence  d   =  4e    for  equispeed   processes. 

Proof    of  Theorem  2 

Let  u  be  the  number  of   rounds   required    for  user   i  to  be    granted   all   its 

k   resources    in   some   round,    given  that   i    starts    requesting   them  at      time 

t^   and  assuming    that   processes   are  equispeed   after    tj^   and  also  assuming 

any  history  of   the   system  up  to   t^    and  any   oracle   A.   Let  round   i      start 

at   time   t^    i  <    n.      We   have   by    Baye's    formila 

Prob(u=m)    =  (l-p(r       ,A)...(l-p(r^        ,A)>p(r^    ,A) 
i  m-1  m 

By   use   of  Theorem  1,    we    get 


ProKu=m)  <  (1  -^)'""^  ^ 
4ekv     2k.v 


If  u(e  )  is  the  least  number  such  that 
probfu  >  u   (e  )}  <  e  ,   then 


u(e  )  <  


4ekv 


Since   iQgCl  -  -,— , — )    >  -  Aekv  we  have  u(e)  <    4ekv    logf — J    . 

Each  allocation  part  of  a  resource's  round  takes  Z+u  steps.  So, 
it  is  enough  for  the  length  of  the  user's  round  to  be  equal  to  2(2-tv)+ 
1,    by    the  proof   of  Proposition  1.    This   implies    that 

duration      of      a  user's    round  <     (2(2-Hj)+l)r  which   implies   (for  \i 

independent    of    k,v) 


Prob{TA^k<    (2(2-tv)+l)r^^x  •    u(e  )}    >    l-e 


y^(c)    =0(pkv    log(i))   =  0(kv    log(i)) 


f,    =  0(kv)    . 


II. 2      The  case   of   tame  processes 

In    this    case   we   no    longer  assume  that   the   processes   are  equispeed, 

but   that  they  are   tame,    as   defined   in  Section   1.1.      Our  proof  parallels 

that  of   the  equispeed  case  of  the   previous  paragraph.      However,   in     the 

more     general  case   of   tame  processes,   we  have  to   set  B    =    2  ,    due  to 

'^min 
Proposition  3. 

In  Theorem   1,    the   constant      d      is     now     A  e.      A     proof      of      a 

generalization      of     Proposition      3  is   easy    to  derive  since    durations    of 

user's    rounds   are  equal   in   terms   of   number  of   steps   and   a  step      of      the 

slowest     process      can      correspond      to     at     most      ^max^^min  ^^^P^   of   the 

fastest   process.      Finally,    the  allocation  duration  now  can  be      at     most 


user's      round      to     be      (2(2-Hi)+l)      steps,      leading      to     at 

(2(2-Hi)+l)(r^ax/^min^   ^^^  ^°^  ^^  user's    round.     This   leads   to 


^2 
Y,  (e)    =  0(m    Jl^kv   log(i) 
^  ^min 


implying   again  f^  =  0(k.v) 
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